# Real-time Transcription

Faster Distil Whisper Large V3.5
MIT
Distil-Whisper is a distilled version of the Whisper model, optimized for Automatic Speech Recognition (ASR) tasks, offering faster inference speeds.
Speech Recognition English
F
Purfview
565
2
Faster Distil Whisper Large V3.5
MIT
A CTranslate2 format model converted from Distil-Whisper large-v3.5 for efficient speech recognition
Speech Recognition English
F
deepdml
58.15k
2
Whisper Large V3 Turbo Gguf
MIT
Whisper large-v3-turbo is a pruned and fine-tuned version based on Whisper large-v3, with the decoder layers reduced from 32 to 4, significantly improving speed while slightly reducing quality.
Speech Recognition Supports Multiple Languages
W
xkeyC
546
1
Distil Large V3.5 Ct2
MIT
Distil-Whisper is a distilled version of the Whisper model, achieving efficient speech recognition through large-scale pseudo-labeling technology
Speech Recognition English
D
distil-whisper
264
3
Lite Whisper Large V3 Turbo Acc
Apache-2.0
Lite-Whisper is a lightweight version of OpenAI Whisper compressed using LiteASR technology, maintaining high accuracy while reducing model size.
Speech Recognition Transformers
L
efficient-speech
7,414
7
Moonshine Base ONNX
MIT
ONNX-format automatic speech recognition model based on the Moonshine base model, supporting efficient inference
Speech Recognition Transformers
M
onnx-community
1,171
29
Moonshine Tiny ONNX
MIT
Moonshine Tiny is a lightweight automatic speech recognition (ASR) model suitable for embedded devices and edge computing scenarios.
Speech Recognition Transformers
M
onnx-community
60
6
Moonshine Base
MIT
Moonshine is a series of automatic speech recognition (ASR) models developed by Useful Sensors, specifically designed for English speech transcription, excelling on resource-constrained platforms.
Speech Recognition Transformers English
M
UsefulSensors
6,857
32
Whisper Tiny Chinese
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 11.0 Chinese dataset based on OpenAI Whisper Tiny model
Speech Recognition Transformers Chinese
W
jethrowang
99
1
Whisper Base.en
Whisper is a general-purpose speech recognition model trained by OpenAI. This model is based on large-scale weakly supervised training and supports speech transcription in multiple languages.
Speech Recognition Transformers
W
onnx-community
76
1
Whisper Base
Whisper is an automatic speech recognition (ASR) system trained by OpenAI, supporting multilingual speech transcription.
Speech Recognition Transformers
W
onnx-community
5,704
19
Faster Distil Whisper Large V3
MIT
Distilled version of Whisper Large v3 for efficient automatic speech recognition (ASR)
Speech Recognition English
F
Systran
18.55k
49
Nue Asr
Apache-2.0
Nue ASR is an end-to-end Japanese speech recognition model that integrates pre-trained speech and language models, offering high accuracy and fast recognition speed.
Speech Recognition Transformers Supports Multiple Languages
N
rinna
722
24
Distil Medium.en
MIT
Distil-Whisper is a distilled version of the Whisper model, 6 times faster than the original, with a 49% reduction in size, while maintaining performance close to the original in English speech recognition tasks.
Speech Recognition English
D
distil-whisper
186.85k
120
Whisper Small Ml
Apache-2.0
This model is a fine-tuned version of openai/whisper-small for speech recognition, supporting multiple languages and suitable for automatic speech recognition tasks.
Speech Recognition Transformers
W
kavyamanohar
23
2
Whisper Small Turkish Tr Best
Apache-2.0
Turkish speech recognition model fine-tuned based on OpenAI Whisper-small, with a word error rate of 26.34%
Speech Recognition Transformers
W
erenfazlioglu
61
4
Whisper Medium
Whisper Medium is a medium-scale speech recognition model developed by OpenAI, supporting automatic speech recognition (ASR) tasks in multiple languages.
Speech Recognition Transformers
W
Xenova
871
4
Whisper Small
Whisper Small is a small automatic speech recognition (ASR) model developed by OpenAI, capable of converting speech into text.
Speech Recognition Transformers
W
Xenova
1,716
9
Whisper Base
Whisper is an automatic speech recognition (ASR) system trained by OpenAI, supporting speech-to-text tasks in multiple languages.
Speech Recognition Transformers
W
Xenova
6,204
7
Faster Whisper Small
MIT
Transformer-based automatic speech recognition (ASR) model supporting multilingual transcription
Speech Recognition Supports Multiple Languages
F
guillaumekln
4,599
15
Wav2vec2 Live Japanese
Apache-2.0
A Japanese speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting hiragana output
Speech Recognition Transformers Japanese
W
ttop324
20
4
Waynehills STT Doogie Server
Apache-2.0
A fine-tuned speech recognition model based on Doogie/Waynehills-STT-doogie-server
Speech Recognition Transformers
W
Waynehillsdev
28
0
Distil Wav2vec2
Apache-2.0
Distil-wav2vec2 is a distilled version of the wav2vec2 model, with a 45% reduction in size and a two-fold increase in inference speed, suitable for automatic speech recognition tasks.
Speech Recognition Transformers English
D
OthmaneJ
854
11
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase